CDS

Accession Number TCMCG024C33578
gbkey CDS
Protein Id XP_022011015.1
Location join(12803742..12803912,12805360..12805515,12806135..12806280,12806374..12806747,12806917..12807032,12807849..12807992,12808093..12808341,12808425..12808616,12808706..12808795)
Gene LOC110910724
GeneID 110910724
Organism Helianthus annuus

Protein

Length 545aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022155323.2
Definition transcription initiation factor IIF subunit alpha [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category K
Description Transcription initiation factor IIF subunit
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
KEGG_ko ko:K03138        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03022        [VIEW IN KEGG]
map03022        [VIEW IN KEGG]
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0005515        [VIEW IN EMBL-EBI]
GO:0008022        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGTCGTTTGATTTGGTGCTGAATCCGTCGTGTGACGGTTGCAGATCGACGGTGGAGTTGTATGGTAGCAATTGTAAACACATGACGCTTTGTGTGGCTTGTGGCAAGACCATGGCTGAGCGCAAGGACCGGTGCCGTGATTGTGGCGCCACCATCACTCGTTTAATTAGGGAATACAATGTGCGTGCAAGTGCAGCCAGCGATAAGAATTACTTCATAGGAAGGTTTGTGACTGGTTTACCAAGTTTCTCAAAGAAGAAAAATGACAACAAGTGGTCTCTCCAGAAAGAAGGATTACAGGGACGTCAAGTCAGCGACACCCTACGGGAGAAATACAAGAACAAACCTTGGCTTTTGGAAGATGAAACTGGGCAATTTCAGTATCAGGGTGTACTCGAGGGTGCACAAACGGCAACATACTACCTTCTCATGCTGCAGGGGAAGGAGTTTGTTGCGATCCCTGCTGGTTCATGGTACAACTTCAACAAAGTTGCACAGTATAAGCAGCTTACGCTTGAGGAAGCAGAAGAAAAGATTAAAAACAGAAGGAAAACTGCCGATGGTTATCAGAGATGGATGATGAAAGCAGCAAACGGCGGAGCTGCTGCTTTTGGTGAAGTTGAAAGGTTTGATGACAAGGAAAGCGGTGGAGCGGGTGGGAGAGGGGGGCGTAAAAAGAATAACGCCGATGATGACGAGGCAAACGTGTCAGACCGGGGAGAAGAAGATGAAGACGAGGAGTCTGCTAGGAAAACGAGACTTGGACTTAATAAGAGAGGTGGCGATGATGATGAGGAAGGTCCTAGAGGCGGTGATCTTGATGGTGATGATGATGACATTGAGAAGGGTGATGACTGGGAGCATGAGGAAATTTTCACAGATGACGATGAAGGTGTGGCCAACGATCCCGAGGAACGGGAAGATTTGGCCCCTGAAATCCCTGCTCCTCCAGAAATCAAGCAGGATGACGAAGATGAGGAGGATAATGAAGAAGAGGAAGGAGGACTGAGCCAATCTGGAAAAGAGTTGAAGAAGCTGCTCGGGAAAAATAGTGGGGCCAATGAATCCGAGCCAGAGCAAGAGGATGATGACGATGACGACGATGATATTGAAGACGAAAGTTCCCCTGTTCTTGCACCAAAGGCTAATAATAACGGGGGTCCATCAAAGCGTAATAACAACCCTCTTAAAGAGGAACCCGTCGACAATAGCCCCTCAAAGCCAGCAGCTGCTACAACATCAGCTCGGGGAACCCCATCTTCAAACAAGTCGGCTAAGGGAAAGCGAAAAAGCACTGAAGAGAACAAACCGTCAAATGGTGCCGCCACAGCTTCAAAGAAAGTCAAAACCGAAAATGACGTGAAACATGTAAAGGAGGAGCCTGCAAAGGCCGGTAAAGGTTCTTCTTCTAAACCAGCAGGTGCAGGTGCCTCATCTGCTACCGGACCTGTCACGGAAGAAGAAATTTCGGCGGTTCTGCTGCACAACGCACCTGTCACCACACAGGATCTTGTTGCTAAGTTTAAATCCCGCTTACGCACCAAAGAGGACAAAAATGCGTTTGCAGAAATTCTGAGAAGGATTTCCAAGATACAGAAGACCAACGGTGCCAACTATGTGGTGCTGAGAGACCGATGA
Protein:  
MSFDLVLNPSCDGCRSTVELYGSNCKHMTLCVACGKTMAERKDRCRDCGATITRLIREYNVRASAASDKNYFIGRFVTGLPSFSKKKNDNKWSLQKEGLQGRQVSDTLREKYKNKPWLLEDETGQFQYQGVLEGAQTATYYLLMLQGKEFVAIPAGSWYNFNKVAQYKQLTLEEAEEKIKNRRKTADGYQRWMMKAANGGAAAFGEVERFDDKESGGAGGRGGRKKNNADDDEANVSDRGEEDEDEESARKTRLGLNKRGGDDDEEGPRGGDLDGDDDDIEKGDDWEHEEIFTDDDEGVANDPEEREDLAPEIPAPPEIKQDDEDEEDNEEEEGGLSQSGKELKKLLGKNSGANESEPEQEDDDDDDDDIEDESSPVLAPKANNNGGPSKRNNNPLKEEPVDNSPSKPAAATTSARGTPSSNKSAKGKRKSTEENKPSNGAATASKKVKTENDVKHVKEEPAKAGKGSSSKPAGAGASSATGPVTEEEISAVLLHNAPVTTQDLVAKFKSRLRTKEDKNAFAEILRRISKIQKTNGANYVVLRDR